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Abstract 

This paper shows that previously reported gener- 
ation algorithms run into problems when dealing 
with f-structure representations. A generation 
algorithm that is suitable for this type of rep- 
resentations is presented: the Semantic Kernel 
Generation (SKG) algorithm. The SKG method 
has the same processing strategy as the Seman- 
tic Head Driven generation (SHDG) algorithm 
and relies on the assumption that it is possible 
to compute the Semantic Kernel (SK) and non 
Semantic Kernel (Non-SK) information for each 
input structure. 



1 Introduction 

In this paper we take up the problem of (tacticaly) 
generating a string from an f-structure representation; 
we will show that generation algorithms that have al- 
ready been described in the literature are not directly 
applicable to this type of representations, and we will 
propose a new generation algorithm: the Semantic 
Kernel Generation (SKG) algorithm. We will also 
show that since the SKG generator is guided by the 
semantic-head and syntactic-head relations, it can be 
seen both as a variant of the semantic head driven gen- 
eration algorithm) (SHDG) (Shieber et al. 1990) and 
the syntactic-head driven generation algorithm (Syn- 
HDG) (Konig 1994). A former version of this work 
was originally reported in (Nicolov et al. 1996), which 
also describes alternative generation algorithms for f- 
structure representations. 

2 The Semantic Kernel Generation 
Algorithm 

In order to show why former generators fail to generate 
from f-structures and why SKG works, we will assume 
the grammar fragment and lexical entries (originally 
described in (Nicolov et al. 1996)) which are shown 
in Figures 0, |^.^ As for the grammar fragment, note 
that rules la and lb introduce modifiers at sentence 
level, and rule 3 introduces modifiers at vp level. iJwZe 
2 combines the subject with the vp.Rule 4 deals with 
the complements of a vp. 

The analysis of the sentence The little prolog pro- 
gram generated the complex sentence quickly is shown 
in Figure ||.^ Note that input semantics represents the 
deep predicate argument structure of sentences for the 
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Figure 1: Lexical entries for sentence and program 



generator; modifiers are contained in set-valued fea- 
ture "mod". 

First we look at the results we obtain after applying 
former generation methods on f-structure representa- 
tions and then we describe the SKG algorithm. 

For expository purposes we will use the graphical 
notation used in (Konig 1994) to describe the gen- 
eration algorithms. Following (Konig 1994): we will 
assume that the syntax-semantics-relation for a given 
grammar is stated by pairs of trees. The left tree shows 
a local syntactic dependency, whereas the right tree 
defines a local semantic dependency. We also assume 
that there is a one-to-one mapping from the nonter- 
minal leaf nodes of the syntactic tree to the leaf nodes 
of the local semantic tree. Note that this is only a 
graphical notation for the rule-to-rule hypothesis, i.e., 
the fact that in the grammar each syntactic rule is re- 
lated to a semantic analysis rule. An example is given 
below: 



s VP(NP) 
np^p NP^P 



(1) 



^For the grammar fragment only the relevant semantic 
information is shown. 

■^The example is due to Nicolas Nicolov. 



The head-corner generator ((Noord 1993), a variant 
of SHDG) and SynHDG are (graphically) described in 
Figure ^ (taken directly from (Konig 1994)). The lex 
rule is the prediction step of the algorithm, i.e. it re- 
stricts the selection of lexical entries to those that can 
be linked with the local goal (visualized by a dotted 
line). Thehc_complete rule is the bottom-up step which 
selects a rule for which xh is the syntactic head and 
Xfi is the semantic head. As a result, it also predicts 
the head's sisters, which have to be expanded recur- 
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Figure 2: Grammar Fragment. Only semantic information is shown 
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Figure 3: Input semantics 



sively (top-down prediction). The difference between 
SHDG and SynHDG is the hnk relation for semantic 
structures: in (Noord 1993) the semantic-based link 
relation is defined as foUows: 



link{X,X,) if 



(2) 



X and Xi are identical. If we represent semantics 
using first order terms, then we only have to check 
whether X and Xi unify. As for the SynHDG algo- 
rithm, the semantic-based link relation is defined as 
follows (Konig 1994): 

link{X,X,) if (3) 

Xi is a substructure oi X . In practical terms, Xi 
is an element of the bag of semantic keywords that 
constitute X (Konig 1995). 

2.1 The direct application of former 

generation procedures to f-structure 
representations 

We will illustrate the problems SHDG (more specif- 
ically, a variant of it: the head-corner generator de- 
scribed in (Noord 1993)) runs into when dealing with 
f-structures by following its application to the input 
semantics given below (which corresponds to the np 
the complex sentence): 



cat: 
sem: 



np (4) 

rel: sentence 

def: + 

mod: [complex] 

According to the algorithm described in Figure g and 
due to the syntactic-head and semantic-head link re- 
lation^ the lex rule can onlv be applied to the lexical 
entry for sentence (Figure |lj). However, since the se- 
mantic link relation is defined in terms of unification^ 
applying rule lex leads to a new semantic goal which 
is identical to the input semantics. Next we need to 
apply to apply the hc_complete rule; rule 7 is the only 
possible candidate. After applying it, our current goal 
is the following: 

cat: n2 (5) 

sem: rel: sentence 

def: + 

mod: [complex] 

At this point, a new hc_complete step needs to be 
taken. Now we have two candidates: rules 6 and 8. 
If we select rule 6, and after generating recursively the 
determiner, we end up having generated only part of 
the sentence: the sentence. Rule 8, in its turn, can be 
always selected; consequently, we could end up having 
semantic goals that would look like that: 

(6) 
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mod: [X,... [complex] 

In other words, the generator would loop and would 
not terminate. 



The problems discussed above lead us to the follow- 
ing conclusion: the SHDG generator is neither com- 
plete nor coherent. These issues also arise with first or- 
der terms (see discussion in (Shieber et al. 1990)); the 
problem here is that we lack the definition of grounded 
feature structures. 

2.2 Semantic Kernel Generator 

The main assumption behind the SKG algorithm is 
that the generator is capable of distinguishing between 
the following types of semantic information within in- 
put structures: 

• Semantic Kernel (SK) Information: Seman- 
tic structure completely which is predictible from 
the lexicon (i.e, there is at least one lexical entry 
which subsumes this structure). 

• Non Semantic Kernel (Non-SK) Informa- 
tion: Semantic structure which is not predictible 
from any lexical entry (typically, lists). In our 
grammar, modifiers are represented as a list. This 
list is Non-SK information. 

Similarly, the generator is given the following informa- 
tion with respect to the types of rules: 

• SK Rules: Rules which do not add Non-SK in- 
formation. 

• Non-SK Rules: Rules which add Non-SK infor- 
mation. Rules 1,3,8 in our grammar. 

The hypothesis behind this classification is that 
of structural predictibility: SK information comes 
from the lexicon (i.e, SK information can be seen as 
grounded feature terms), and non SK information is 
introduced through rules. In other words, the genera- 
tor knows whether each type of input structure comes 
from a lexical entry or whether it has been constructed 
from a (non SK) rule. Thus, the restrictedness of the 
algorithm is due to the fact that it operates under the 
assumption that we can recursively decompose each 
input structure into SK and Non-SK information. 

A graphical version of the SKG algorithm is given 
in Figure ^. 

In sum, this is the information the generator needs 
to know about the grammar: 

• link relation (head relation). 

• The SK and Non-SK substructures of a given se- 
mantic representation. 

• The distinction between SK rules and Non-SK 
rules. 

• The syntactic goals to generate SK and Non-SK 
information. 

• The syntactic goal we obtain after combining SK 
and Non-SK information. 

In order to show how the SKG algorithm works we 
will follow its application to the input semantics for 
the complex sentence given in example |^. For this 
input semantics, we have two SK structures: 



rel: sentence 
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(7) 
(8) 



all leaves are labeled with terminals and the tree does not contain any dotted lines (global-success) 
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Figure 4: Head-Corner Generator (G grammar description; syntactic category; Xi semantic representation) 
(Taken from (Konig 1994)) 



and one nonSK structure: 

mods: [complex] 



(9) 



The lex rule cannot be applied because of the SK struc- 
ture condition: input semantics has nonSK informa- 
tion. Thus, we can only apply the second hc-corner 
step. This forces us to start from rule 6 and generate 
(top-down) the following goals: 

det 1 (10) 



cat: 

sem: def: 
cat: n2 

sem: rel: sentence 
mods: [complex] 



(11) 



Note that the generator has been told about the rela- 
tion between nonSK information and SK and nonSK 
rules, therefore it knows where the modifiers come 
from. The determiners generation is reduced to ap- 
plying the lex rule. To generate the n2 goal (example 
[ll|) we proceed as before; this goal has nonSK informa- 
tion, so the generator starts from rule 8 and generates 
(top-down) the appropriate subgoals: 

n2 1 (12) 



cat: 

sem: rel: sentence 

cat: adj 

sem: rel: complex 



(13) 



The generation of the subgoals above is straightfor- 
ward, since they do not contain nonSK information 
and the Zexrule can be applied without problems. The 
application of the hc_corner step to each of the sub- 
goals deserves further comments, since we are runing 
the risk of having termination problems. For example, 
once we have applied the lex rule and the hc-corner 
step for sentence we obtain the goal in example |l^. 
One may wonder whether we could apply rule 8 again 
and end up having subgoals like the following: 



cat: 
sem: 



n2 
rel: 
mods: 



sentence 
[X,...] 



(14) 



This situation is not possible, since if we only have SK 
information only SK rules can be applied, and rule 8 is 
a nonSK rule (see conditions on hc-complete step (1) 
in Figure ||). 

Another example will clarify how the generator 
works. Assuming we want to generate a string for the 
semantic representation in Figure the following sen- 
tences should be generated according to the grammar:"^ 

• the {little,prolog} program generated the complex 
sentence quickly. 



In this example we will only concentrate in the genera- 
tion of qmckly at sentence or vp level; the rest of modifiers 
{complex, little, prolog) for the np level would be generated 
in an identical manner. 
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Figure 5: Semantic Kernel Generator (G grammar description; Xi syntactic category; Xi semantic representa- 
tion; SK{X) is true if X contains only SK information; Gsk grammar description with only SK rules; G^sK 
grammar description with only nonSK rules; sk{X,Xh) is true if X^ is SK information of X (Adapted from 
(Konig 1994)) 



• quickly the {little, prolog} program generated the 

complex sentence. 

• the {little, prolog} program quickly generated the 
complex sentence. 

The generator detects that the semantic representa- 
tion consists of both a SK structure and a Non-SK 
structure ("quickly"). Thus, according to the gram- 
mar, there are several ways of generating these struc- 
tures given the original goal (which is a sentence). The 
generator tries the following combinations: 

• It generates a "S" string for the SK information 
and a "adv" string for the Non-SK information. 

Both strings can be combined in two ways (which 
corresponds to rules Rla and Rib). This gives us 
two of the possibilities. 

• It generates a "VP" string for the SK informa- 
tion and "adv" string for the Non-SK information 
(this corresponds to rule R3). After generating 
these strings, and according to rule R2, we link 
the "VP" to "S" 

3 Discussion 

The SKG generator proceeds top-down, generating the 
appropriate subgoals, when it finds nonSK informa- 
tion. It proceeds bottom-up when lexical prediction 
can be made (when there is only SK information), and 
the head- corner step for SK information can only be 
performed using SK rules, thus avoiding termination 
problems. 

The way our generator works and the distinction 
between SK and Non-SK information resembles the 

definition of the restrictor operator and the treatment 
of modifiers given in (Wedekind & Kaplan 1993). The 
difference is obviously the context of application: In 
(Wedekind & Kaplan 1993) the main interest is struc- 
tural misalignment between f-structure and semantic 
representations, whereas our concern is string genera- 
tion from f-structure representations. 

4 Implementation 

Our framework has been the Sicstus-Prolog version of 
the CUF language (Dorre & Dorna 1993) plus a layer 

on top of it which implements the grammar formalism, 
the (left-corner) parser and the SKG generator. 

5 Conclusions 

We have shown that for f-structure representations 
previously proposed generation algorithms run into 
problems, and therefore, we have presented an alter- 
native: the SKG algorithm. The main assumptions 
behind the algorithm is the following: for each seman- 
tic input it is possible to compute its SK and Non-SK 
information. We have also shown that our approach 
resembles the definition of the restrictor operator and 
the treatment of modifiers given in (Wedekind & Ka- 
plan 1993) to deal with structural misaligments be- 
tween f-structure and semantic representations. 
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